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CUSTOMER PREFERNCE SYSTEM 



PBIORTTY INFORMATION 

5 The present application claims all applicable priority and other legal benefit based on 

the following application: Provisional U.S. patent application number 60/269,419 entitled 
"Method and System for Matching Products with Interests and Demographic Profiles." 



FIELD OF THE INVENTION 

10 The present invention is directed to user opinion systems, and more particularly to 

customer preference systems for predicting customer opinions about products and attributes 
ofproducts. 



BACKGROUND OF THE INVENTION 
15 Conventionally, information about customer preferences, predilections, behavior, 

opinions and buying habits has been informally collected, maintained and analyzed by certain 
businesspeople, such as salespeople and marketing experts. More recently, customer 
preference information has increasing been collected, maintained and analyzed by computer. 
The field of computer based customer preference technology is relatively new having 
20 emerged firom recent advances in the ability of computers to efficiently handle large amounts 
of information. The computers can better collect, organize and store large amounts of 
information about customer behavior. The computers can also do complex or data-intensive 
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forms of statistical analysis on large consumer behavior databases to predict customer desires 
or future purchases. 

There are several different conventional techniques that been used to make predictions 
about user opinions (e.g., customer preferences): 
5 1. Collaborative Filtering. To predict a subject user's opinion, the user is 

compared with other users in a user opinion database to determine which other users tend to 
hold similar opinions. The known opinions of some or all of the similar users are used as a 
basis for predicting opinions of the subject user. 

2. Content Based Systems. To predict a subject user's opinions (e.g., 

1 0 preferences) about a topic (e.g., a product), known opinions for the user about attributes of the 
product, from the user opinion database, are used as a basis for predicting the users opinions. 
For example if it is known that a user is highly partial to the color red, is highly partial to 
cotton clothing and is highly partial to sweaters, then acontent based user opinion system 
would predict that the user would be highly partial to a red, cotton sweater because this 

1 5 product is made up of attributes about which the users holds favorable opinions. 

3. Expert Based or Rule Based Systems. To predict a subject user's opinions, 
predetermined rules are supplied (eg., by a knowledge expert). While expert based systems 
may use customer databases as input information, expert systems do not depend on observed 
data to process the inputs to make predictions. Rather, the expert's knowledge and rules 

20 control this information processing. As such, expert based systems are subject to limitations 
on the knowledge and predictive powers of human experts. 
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The foregoing three types of user preference systems may (ormay not) employ 
mathematical or statistical concepts of: (1) fazxy logic, and (2) trainable vector functions. 
These concepts are briefly explained below. 

1 . Fuz2y Logic. A formal system of logic in which numbers on a scale from 0 to 
5 1 are used instead of the values "tme" and "false" as absolutes, to accurately represent the fact 

that some questions do not have a simple yes or no answer. Many issues, including many 
expressions of various customer preferences, are amenable to fiizzy logic type analysis. 

2. Trainable Vector Function. This is a technique that can be used in conjunction 
with the various types of user opinion systems enumerated above. In using a trainable vector 

10 function, some form of predicate information is used to "predict" other known information 
present in the database. For example, the predicate information may take the form of known 
customer opinions regarding product attributes, or predicted user opinions regarding user 
attributes. He trainable vector function "prediction" is made without resort to the known 
information that is being predicted based on the predicate information. If the trainable vector 

1 5 function predictions of the known information tend to be very close to the actual known 

values of the known information, then it may be inferred that the prediction algorithm works 
well and will also work well to predict unknown information using the prediction edgorithm 
and similar predicate information inputs. On the other hand, if flie trainable linear vector 
function predictions do not conespond well with the predetermined, known opinions, then the 

20 algorithm may be discarded or modified until it yields good "predictions" of predetermined, 
known opinion data. A neural network is one example of implementation of a trainable 
vector function. 
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Some specific patent publications in the area of customer preference systems will now 
be discussed. 

U.S. patent 4,870,579 ("Hey") discloses a customer preference system wherein a 
selected user's reaction to an unsampled item is predicted baaed upon the reactions of other 
5 users who have sampled the item. The data from the other users is weighted depending upon 
an agreement scalar, which is intended to reflect the degree of respective similarity between 
the selected user and each of tiie other users. 

U.S. patent 5,704,017 ("Heckerman et al.") discloses a system and method for 
predicting user preferences. In Heckerman et al., data in a user database is used to create and 

1 0 test a plurality of competing belief networks. The belief network that is determined to be best 
is then employed as part of a collaborative filtering algorithm. This collaborative filtering 
algorithm, including the best belief network, is repeatedly employed to predict user opinions. 
Periodically, as new information is added to the user database, the user database may again 
be used to create and compare a new set of competing belief networks. Again, the belief 

1 5 network adjudged to be best is used to effectively make a new collaborative filtering 

algorithm. The new collaborative filtering algorithm is then used to repeatedly determine 
unknown user opinions. 

One potential drawback of the Heckerman, et al. system is that the new belief 
networks are generated only periodically. This means that the resultant collaborative filtering 

20 algorithm may be unoptimal because new information may have been added to the user 
database since the collaborative filtering algorithm was last updated. Perhaps more 
fimdamentally, another potential drawback of the Heckerman et al. system is that the optimal 
belief network and resulting collaborative filtering algorithm is not optimized on a case by 
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case basis, but rather optimized for the user database as a whole. While the collaborative 
filtering algorithm deemed to be the best by the Heckennan, et al. system may be the best, on 
average, across a wide spectrum of users and known user information, in practice the amoimt 
and type of available user information will often vary firom user to user. However, 
Heckerman, et al. uses the same collaborative filtering algorithm, regardless of what 
information happens to available for a specific user. Finally, Heckerman, et al. only 
contemplates one type of algorithm: collaborative filtering (based on expert rules and/or 
known data). 

U.S. patent 4,870,579 ("Miller et al.") discloses a predictive system wherein various 
collaborative filtering algorithms are evaluated and compared against each other using 
receiver operating curves. In this way, a preferred collaborative filtering technique can be 
used for a given user opinion prediction problem. It is noted that Miller et al. is limited to 
collaborative filtering algorithms. It is further noted that Miller et al. does not combine the 
results of its various algorithms in making a prediction of a user opinion. Ratha:, Miller et al. 
attempts to use the single, best algorithm to make its predictions of user opinions. 

U.S. patent 6,101,484 ("Halbert, et al.") discloses the calculation of price demand 
curve based on known information. In Halbert e< al, the known information is primarily 
customer demand, expressed as binding maximum bids for products offered in a type of 
auction. 

To the extent that specific publications are discussed above, these discussions should 
not be taken as an admission that the discussed publications (e.g., patents) are prior art for 
patent law purposes. For example, some or all of tiie discussed publications may not be 



wo 02/079901 PCT/USOl/45175 
Atty Docket No.: 28749-2/P02 Page 6 of 54 Patent 

sufficiently early in time and/or sufficiently enabling so as to amount to prior art for patent 
law purposes. 



5 SUMMARY OF THE INVENTION 

Now some basic terminology will be discussed. First, a user opinion database (see 
definitions section) is used to mean any database where a large number of user opinions are 
stored, either directly or at least inferentially. In addition to user opinions, the user opinion 
database preferably store other types of information, such as user attributes, information 

10 regarding products and information regarding product attributes. Both the user opinions, and 
the other types of information will generally fall into one of two types: (1) known (or 
explicit) information; and (2) predicted (or estimated) information. 

One example of a user opinion database is a conventional customer preference 
database. In a customer preference database, the users are custjmers and potential users who 

1 5 actually or potentially purchase, rent or otherwise give value in exchange for products and/or 

services. There are other types of user opinion databases, such a pohtical opinion database, 
used to predict political opinions and opinions on candidates for public office. 

The opinions of the user opinion database may be known opinions and/or predicted 
opinions. Known opinions are obtained when there is direct information about a user's 

20 opinion regarding a topic (e.g., a product) or attribute, such as when a customer responds to a 

questionnaire asking for opinions regarding a product. Predicted opinions are opinions 
determined indirectly from other data in the user opinion database. Often, the prediction of 



wo 02/079901 PCTAJSOl/45175 
Atty Docket No.: 28749-2/P02 Page 7 of 54 Patent 

an opinion will be the primary use of the user opinion system. In other cases, a predicted 
opinion may be used to help predict other opinions. 

For example, a user's opinion about an attribute of a product may be used to help 
predict the user's overall opinion regarding the product The opinions regardmg products are 
5 herein referred to as product opinions. The opinions regarding attributes present in various 
products are herein referred to as attribute opinions. 

Now that the generic concept of a user opinion database has been estabUshed, the 
concept of a user will be discussed. Often a user will be an individual, such as a customer 
who purchases household supplies for a household. However, the user may be any other 

10 entity capable of having an opinion, such as: a family, acoiporation, a political jurisdiction, 
an artificially intelligent computer based being, an animal and so on. So long as an entity can 
somehow express some sort of opinion (e.g., a dog that prefers one type of dog food over 
another), it can be considered a user. 

In flie process of predicting user opinions based on the user opinion database, there are 

15 two types of users: (1) the subject user; and (2) other users. The subject user is the user who 
is the subject of the prediction. The other users are everybody else in the database. In the 
process of predicting user opinions based on the user opinion database, the opinion being 
predicted will be referred to as the subject opinion. Other opinions stored in the database will 
be referred to as other opinions. 

20 The opinions may take different forms depending on the associated issue. A yesor- 

no opinion would be an example of an opinion in binary form, where only two possible 
opinions are appropriate. Another form for opinions would be discrete opinions, wherethere 
are two or more possible opinions, but where the number of possible opinions is finite. An 
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example of an issue that woiild be associated with a discrete numerical opinion would be: 
rate Brand X widgets on a scale of 1 to 100. An example of an issue that would give rise to a 
discrete narrative opinion would be: choose the word that best describes Brand X widgets 
from the following choices: reUable, inexpensive or portable. While the inventive user 
opinion predictive concepts explained in this document may apply to both numerical and 
narrative opinions, numerical opinions, where feasible, are often preferable because they are 
more easily subject to statistical analysis and interpretation. 

At least some embodiments of the present invention may implicate the following 
inventive concepts: 

1 . Predicting a user opinion (eg., an attribute opinion prediction, a product 
opinion prediction) by combining the results from parallel algorithms for predicting user 
opinion. 

2. Predicting user opinions by combining results of different types of parallel 
algorithm. For example, instead of merely using parallel collaborative filtering algoritiuns, 
the predictions of one or more collaborative filtering algorithms may be combined with 
results form one or more content based algorithms and/or one or more expert rales based 
algorithms. 

3. Various novel ways of combining results from various, alternative predictive 
algorithms to form a single predictions. These various metiiods of combining predictions are 
sometimes herein referred to as combiners. 

4. A user decision model for determining product opinion for determining 
product opinions based on attribute opinions, and/or for determining attribute opinions based 
on product opinions. The user decision model preferably uses a trainable non-linear vector 
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function so that known product and attribute opinions of the subject user, or of other users, 
can help determine and optimize the parameters of various algorithms employed in the user 
decision model. The user decision model is preferably non-linear. For example, various 
relevant, known attribute opinions for a subject user may be differently weighted to unequally 
influence a predicted product opinion for the subject user. 

5 . Generation of a price versus demand curve based upon user opinion data (that 
is, known and/or predicted user opinion data). For example, user attribute opinions (known 
and/or predicted) can be used to predict a price versus demand curve for a product having the 
attributes. 

At least some embodiments of the present invention may exhibit one or more of the 
following objects, advantages and benefits: 

1 . More accurate opinion predictions based on a limited amoimt and/or types of 
known opinions and/or known facts available in the user opinion database. 

2. Better targeted advertising based on more accurate product opinion 
predictions. 

3. Better pricing policies, such as optimal market segmentation pricing schemes, 
based on more accurate product opinion predictions. 

4. Better product and/or service design based on more accurate attribute opinion 
predictions and/or better product opinion predictions. 

5 Increased business for parties, such as banks, who are able to offer their clients 
andyor customers more accurate customer opinion predictions. 

According to one aspect of the present invention, a method of making a prediction of 
an opinion of a predetermined user based on data in a user opinion database includes several 
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steps. One of the steps is determination of a plurality of predictions for the predetermined 
user based on a respective plurality of prediction algorithms applied to the data of the user 
opinion database. Another step is delivery of at least one prediction(s) of the plurahty of 
predictions in human readable form. 
5 According to a further aspect of the present invention, a method of determining the 

price for a product includes several steps. One step is provision of a customer opinion 
database including attribute value data and attribute opinion data. The attribute value data 
corresponds to the plurality of attributes and associated attribute values present in the product, 
with one of the attributes being the price of the product. The attribute opinion data is 

10 associated with at least some of the attributes in the list of attributes. Another step is 

provision of a product opinion algorithm for determining a product opinion regarding the 
product for a predetermined customer based on the data in the customer opinion database. 
Another step is determination of a plurality of product opinions for the predetermined 
customer using the product opinion algorithm wherein each product opinion is based upon 

1 5 assumption of a different attribute value for the price attribute. 

According to a further aspect of the present invention, a method of determining an 
estimate of a product opinion of a predetermined product for a predetermined customer 
includes several steps. One step is provision of a list of attributes and associated attribute 
values present in the predetermined product. Another step is provision of attribute opinion 

20 values for at least some of the attributes. Another step is calculation of an estimated product 
opinion value for the predetermined customer based on the attribute opinion values using a 
product opinion algorithm. 
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Further applicability of the present invention will become apparent from a review of 
the detailed description and accompanying drawings. It should be understood that the 
description and examples, while indicating preferred embodiments of the present invention, 
are not intended to limit the scope of the invention, and varioxis changes and modifications 
5 within the spirit and scope of the invention will become apparent to those skilled in the art 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will become more fully understood from the detailed 
1 0 description given below, together with the accompanying drawings, which are given by way 

of illustration only, and are not to be construed as limiting the scope of the present invention. 
In the drawings: 

Fig. 1 is a schematic diagram of a customer preference system according to the 
present invention. 

1 5 Fig. 2 is another schematic diagram of the customer preference system of Fig. 1 . 

Fig. 3 is another schematic diagram of the customer preference system of Fig. 1 . 

Fig. 4 is another schematic diagram of the customer preference system of Fig. 1. 

Fig. 5 is another schematic diagram of the customer preference system of Fig. 1. 

Fig. 6 is another schematic diagram of the customer preference system of Fig. 1. 
20 Fig. 7 is another schematic diagram of the customer preference system of Fig. 1 . 

Fig. 8 is another schematic diagram of the customer preference system of Fig. 1. 

Fig. 9 is another schematic diagram of the customer preference system of Fig. 1. 

Fig. 10(a) is another schematic diagram of the customer preference system of Fig. 1. 
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Fig. 10(b) is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 11 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 12 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 13 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 14 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 15 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 16 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 17 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 18 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 19 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 20 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 21 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 22 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 23 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 24 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 25 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 26 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 27 is another schematic diagram of the customer preference system of Fig. 1. 
Fig. 28 is another schematic diagram of the customer preference system of Fig. 1. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Before starting a description of the Figures, instructions for interpreting the words and 
phrases of this patent document will be provided. More particularly, many jurisdictions allow 
a patentee to act as its own lexicographer, and thereby allows the patentee to provide 
5 instructions in a patent docimient as to how the words, terms and phrases of the document are 
to be interpreted as a legal matter. For example, in flie United States, the prerogative of the 
patentee to act as its own lexicographer has been soUdly established based on statutory and 
case law. Accordingly, the following section provides rules for interpreting the words, terms 
and phrases of this particular patent document. 

10 

INTERPRETIVE RULES 

Rule 1: There is a "Specially Defined Terms" section set forth below. Only words, 
terms or phrases that are explicitly defined in the Specially Defined Terms are to be 
considered to have a special definition, and, of course, the explicit definition provided herein 

15 is to serve as the definition for these terms. Accordingly, sources such as the patent 
specification and extrinsic evidence shall not be used to help define these terms - the 
explicitly provided definitions shall control. 

Rule 2: If a word, term or phrase is not specially defined, then its definition shall be 
determined in the first instance by resort to dictionaries and technical lexicons that either exist 

20 as of the time this patent document is filed. (See definition of "dictionaries and technical 

lexicons" below in the Specially defined Terms section.) It is acknowledged that dictionaries 
and technical lexicons often provide alternative definitions. Also, definitions provided in 
different dictionaries and different lexicons often differ and are not always entirely consistent. 
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In that case, it must be decided which definition is in best accordance with this document. 
Rules 3 and 4, set forth below, provide some guidelines for chooang between alternative 
definitions for a word, term or phrase. 

Rule 3 : The role of the specification (other than the Specially Defined Terms section) 
5 as an interpretive or definitional aid shall be limited to helping choose between alternative 
definitions that meet the requirements of Rule 2 (above). 

Rule 4: The role of extrinsic evidence (e.g., expert witnesses) as an interpretive or 
definitional aid shall be limited to helping choose between alternative definitions that meet 
the requirements of Rule 2 (above). 

10 

SPECIALLY DEFINED TERMS 

the present invention: means at least some embodiments of flie present invention; 
references to various feature(s) of the "present invention" throughout this document do not 
mean that all claimed embodiments or methods include the referenced feature(s). 
15 dictionaries and/or technical lexicons: any document whose primary purpose is the 

definition of words, terms and/or phrases; on the other hand, documents fliat merely discuss, 
explain or provide examples of devices or methods, without purporting to provide definitions 
of specific words, phrases or terms, are not to be considered as dictionaries and/or technical 
lexicons. 

20 user: any entity capable of having an opinion; users include, but are not limited to, 

individuals, groups of individuals, business entities, artificial intelligence beings and social 
institutions. 
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product: any item involved in a commercial transaction; products include, but are not 
limited to, commodity products, non-commodity products, real estate, commodity services, 
professional services and intellectual property; commercial transactions include, but are not 
limited to, purchase, rental, barter and loan. 
5 price: amountofvaluepotentially and/or actually asked and/or received for a 

product; the value may be measured in monetary units or non-monetary units. 

combining: includes, but is not limited to, combining by static averaging, combining 
by determining a weighted average, and combining according to an algorithm determined 
using a trainable linear vector function; combining does not include the process of selecting a 
10 single thing from a group of things. 

trainable vector function: any function wherein a predictive algorithm is tested and/or 
modified based on the difference between known results and predictions of the known results 
obtained by the predictive algorithm. 

known opinion: an opinion directly provided by a user (e.g., known product opinion, 
15 known attribute opinion). 

predicted opinion: an opinion predicted based on other known and/or predicted 
information (e.g., predicted product opinion, predicted attribute opinion). 

To the extent that the definitions provided above are consistent with ordinary, plain 
and accustomed meanings (as generally evidenced, inter alia, by dictionaries and/or technical 
20 lexicons), the above definitions shall be considered supplemental in nature. To the extent that 
the definitions provided above are inconsistent with ordinary, plain and accustomed meanings 
(as generally evidenced, inter alia, by dictionaries and/or technical lexicons), the above 
definitions shall control. If the definitions provided above are broader than the ordinary. 
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plain and accustomed meanings in some aspect, then the above definitions will control at 
least in relation to their broader aspects. 

To the extent that a patentee may act as its own lexicographer under applicable law, it 
is hereby fiirther directed that all words appearing in the claims section, except for the above- 
5 defined words, shall take on their ordinary, plain and accustomed meanings (as generally 

evidenced, inter alia, by dictionaries and/or technical lexicons), and shall not be considered to 
be specially defined in this specification. Notwithstanding this limitation on the inference of 
"special definitions," the specification may be used to evidence the appropriate ordinary, 
plain and accustomed meanings (as generally evidenced, inter alia, by dictionaries and/or 
10 technical lexicons), in the situation where a word or term used in the claims has more than 
one alternative ordinary, plain and accustomed meaning and the specification is helpfiil in 
choosing between the alternatives. 

Customer Preference Systems 

1 5 The field of "customer preference" technology is a relatively new one, having 

emerged fi-om recent advances in computers' ability to efficiently store and organize large 
amounts of information describing customers' behavior and expressed opinions. More 
particularly, this new technology has been helped along by advances in computer network 
communication {e.g., the Internet) and large capacity data storage systems. The notion of 

20 "collaborative filtering" focuses on the idea that a vendor's past experience with past 

customers can be used to more quickly and accxuately predict the needs and desires of new 
potential customers. 

The core ability of a customer preference system is to produce a quantitative 
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prediction regarding a customer's opinion (eg., degree of like / dislike) about a product. To 
do tbis, the customer preference system uses a customer profile, representative of a customer, 
and a product profile, representative of a product The quantitative estimate of customer 
opinion is herein referred to as a predicted opinion. Generally speaking, there are ttiree 
5 distinct classes of predictive algorithms for use in customer preference systems: (1) expert 
based, (2) collaborative filtering, and (3) content based. Expert based systems are discussed 
above. Further remarks on collaborative filtering and content based algorithms follow. 

Collaborative Filtering 

10 There are a variety of possible methods for implementing this customer preference, 

the most common being collaborative filtering. Collaborative filtering utilizes a process of 
estimating the degree of similarity between a potential new customers and past customers. 
The assimiption is that the potential new customer's opinion, about a new product at issue, 
will tend to be similar to known opinions of similar past customers. For example, the known 

1 5 opiiuons may have been provided by past customers in response to a consumer survey. 
Clearly, there are several drawbacks to this approach. 

One of the most fimdamental limitations of this approach, is that it requires that the 
system has previously encoimtered a reasonable number of past customers who actually 
reasonably similar to the new potential customer. Also, the system should have 

20 comprehensive product ratings collected for those past customers. The absence of suitable 
past customers with relevant, known opinions, vidll lead the system to make predictions with 
poor accuracy. 
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One type of collaborative filtering algorithm, used to predict an opinion of a 
predetennined user, determines similarity between the predetermined user and other users 
based upon the degree of similarity in product opinions between the predetermined user and 
the other users. 

Another type of collaborative filtering algorithm, used to predict an opinion of a 
predetermined user, determines similarity between the predetermined user and other users 
based upon the degree of similarity in attribute opinions between the predetermined user and 
the other users. 

Another type of collaborative filtering algorithm, used to predict an opinion regarding 
a predetennined product, determines similarity between the predetennined product and other 
products based upon the degree of similarity in product opinions between the predetermined 
product and the other products. 

Content Based Customer Preference Systems 

One means of ckcxmiventing some of these problems arising firom data shortage about 
similar customers, is to use content based methods of predicting a product opinion. More 
particularly, a product can be considered to be made up of attributes and attribute values. For 
example, an attribute may be color and the attribute value may be red. In content based 
systems, the users known attribute opinions, relating to some or all attributes of a product, are 
used to predict a product opinion for the subject user. Attributes of a product might include 
its price, its color, the presence/absence of a feature, and so on. If the potential new 
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customer's attribute opinions regarding the attributes is known, or can be predicted, then these 
attribute opinions may be combined to predict Ihe product opinion. 

For example, content based prediction can be made by finding a measure of similarity 
between products, based on the products' constituent attributes and attribute values. In this 
case, the content based prediction would be based upon the subject user's known opinions for 
products determined as similar to the subject product Once again, sufficient predetermined 
data about the potential new customer's product opinions for similar products is required. 
Otherwise, the accuracy of the prediction will tend to be poor. 

One type of content based algorithm, used to predict an opinion of a predetermined 
user, determines similarity between the predetermined user and other users based upon the 
degree of similarity in attributes of the user between the predetermined user and other users. 
This term "attributes of the user" refers to factual data regarding the user, as opposed to the 
opinions held by the user. Such factual data may include characteristics such as age, gender, 
religion, hair color, weight and so on. 

Another type of content based algorithm, used to predict an opinion regarding a 
predetermined product, determines similarity between the predetermined product and other 
products based upon the degree of similarity in attributes of the product between the 
predetermined product and other products. These attributes of the product are actual features 
of the product, as distinct fi:om product opinions proffered by users regarding the product. 

User Decision Models 

One type of user decision model algorithm is herem called ARPA. ARPA is a static 
(that is, non-trainable) algorithm wherem a predicted product opinion for a predetermined 
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user is calculated. The predicted product opinion is calculated by collecting attribute 
opinions (known and/or predicted) for the predetermined user. Along with the attribute 
opinions, associated attribute weights are also collected. These attribute weights reflect the 
relative importance of each attribute to the user. Some attributes are more important than 
others. The ARPA user decision model algoritiun accounts for this fact in its use of attribute 
opinion weights. 

Under the ARPA algorithm, a product of each collected attribute opinion and its 
associated weight is calculated. These products are summed (and normalized, as necessary) 
to form the predicted product opinion for the user. 

Another user decision inodel is herein called PRAR. PRAR is a trainable user 
decision model. Under PRAR, there are two stages; (1) the training stage; and (2) the 
utilization stage. In the training stage, known attribute opinions and known product opinions 
for various users are provided. The known attribute opinions are combined in various ways 
to form product predictions. For example, tiie attributes may be ascribed various weights so 
that differently weighted averages of the attribute opinions yield different predicted product 
opinions. The predicted product opinions, each calculated in different ways, are compared to 
the knovm product opinions. 

In the subsequent utilization stage, the way of combining the attribute opinions that 
gives the most accurate product opinions in tiie training stage is chosen. This chosen 
algorithm is then used to combine attribute opinions when the product opinion is not known. 
Periodically (e.g., when a significant amount of known product opinion data is added to the 
user opinion database), the training stage may be revisited and a new chosen algorithm may 
be determined to best reflect the totality of the data m the user opinion database. 
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Parallel-Combmative Estimation 

Collaborative filtering type and content based type predictive algorithms have 
degraded performance with missing data. Expert based algorithms depend upon the 
5 (generally limited) knowledge of experts. However, each of these three types of algorithms 

depends on non-identical sets of data. Thus, when one method has insufficient data to 
operate eiEfectively, the other may have fully sufficient data available. By combining the 
predictions of various algorithms into a single prediction, the accuracy may be improved. 
This may be especially true when results from algorifcms of different types are combined. 

10 For example, the combination of both content based and collaborative filtering based 

algorithms may give especially accurate results because collaborative filtering algorithms and 
content based algorithms generally tend to depend on highly divergent data sets. 

This combination of predictions from independent algorithms to form a single 
prediction is herein called parallel-combinative estimation. Parallel-combinative estimation 

1 5 combines (see above definition of "combines") the results of various algorithms, rather than 
merely trying to select the best result from a set of parallel algorithms. While each algorithm 
result may weigh equally in the ultimate parallel-combinative prediction, it is usually 
preferable to weigh the results differently in the combination, depending on the degree of 
perceived accuracy of each separate algorithm. Some exemplary combiners, for combining 

20 predictive results, will be discussed below. 

Combiners 
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There are several different types of combiners for combining parallel predictions (e.g., 
predictions of user attribute opinions, predictions of user product opinions) into a single 
parallel-combinative prediction, including: (1) confidence weighting; (2) expert weighting 
(also sometimes called combination of experts weighting); and (3) trainable vector 
5 combinations. 

In confidence weighting, each of the parallel predictions is associated with a 
confidence value. These confidence values are used as weighting factors in calculating a 
weighted average of the various parallel estimates that are being combined. For content 
based estimates and collaborative filtering estimates, the confidence value is proportional to 
10 the similarity and the respective dimensions of the similarities. As shown in Figs. 12 and 15, 
the similarities include product rating similarity 238, product attribute similarity 240, product 
rating similarity 248, attribute rating similarity 250, attribute similarity 252. As shown in 
Figs. 12 and 15, the respective dimensions include product rating dimension 239, product 
attribute dimension 241 , product rating dimension 249, attribute rating dimension 251 and 
1 5 attribute dimension 253 . 

In expert weighting, the parallel estimates are combined by summing the multiple 
estimates weighted by expert's (e.g., domain expert's) assigned weights. 

In trainable vector combinations, parallel estimates are combined in various ways to 
predict known data. The method of combination that yield parallel-combinative predictions 
20 that best match the known data is deemed to be the best method of combination. This method 
of combination is then used to calculate parallel-combinative predictions where the data being 
predicted (e.g., a user opinion) is not knowa One type of trainable vector fimction that may 
be used as a combiner is a neural networL 
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Attribute Based Customer Preference Systems 

Another important possibility for creating content based estimates, is to find attribute 
opinion predictions for various attributes that may be present in one or more products of 
5 interest. The product opinion prediction, for a given product, may be created by combining 
known and/or predicted attribute opinions with respect to a subject user. An evident problem 
with such a process, is that different users will mentally asmbine attribute opinions into 
product opinions in a variety of ways. Some folks will only be concemed about finding a 
product having a single outstandingly-good attribute. Other folks may be more concemed 
10 about the balance of all the attributes, with all the attributes taking on a similar level of 
influence over the resulting product opinion. 

A simplifying assumption is the assumption that every product opinion is inherently 
expressed in the following equation: 

. Product Opinion Prediction = sum(Attribute Opinion) / number of Attributes (1) 
1 5 This equation (1) is known as the Linear Customer Decision Model. The underlying 
assumption that all attributes are equally important is not optimally accurate. 

For increased accuracy, a trainable nonlinear vector function approximator, such as a 
neural network, is used to leam how each individual customer combines attribute opmions 
into product opinions. Here the assumption is that a customer, or even groups of customers, 
20 will tend to combine attribute opinions into product opinions in similar ways. 

A furflier complication of trying to use attribute opinions to create product opinion 
predictions, is that there may not be enough predetermmed data regarding the attribute 
opinions. This may be reconciled somewhat by methods of predicting attribute opinions. 
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The attribute opinion predictions may be used as a substitute for predetennined attribute 
opinions (e.g., attribute opinions gleaned from customer surveys). One method of forming 
attribute opinion predictions for a subject user is to use attoibute opinions of other users who 
happen to be similar to the potential new customer. This is analogous to the collaborative 
5 filtering method of forming product opinions by using predetermined product opinions of 

other, similar users. 

Another way of predicting attribute opinions is to infer attribute opinion predictions 
from predetermined product opmions, where the products include the attribute of interest. 
While this inference process (termed "attribution") is usually under-constrained, and thus has 

10 intrinsic descriptive error, it can be used still sometimes provide meaningful attribute opinion 
predictions, even in the absence of similar, past customers. Attribution works from the 
Linear Customer Decision Model, stipulating that if several, predetermined, known product 
opinions are available for a subject, then the tiiese equations define a set of simultaneous 
equations that can be roughly solved to find predicted attribute opinions. The predicted 

1 5 attribute opinions can be used to help form a more rigorous basis for making new product 

opinion predictions for the subject user. 

Unorthodox Applications 

There exist a variety of ways of determining similarity, so as to produce more 
20 independent predicted opinions, both predicted attribute opinions and predicted product 

opinions. These methods have sHghtly differing data dependencies, and thus can be used in 
parallel (that is combination or selection) to cover each other's weaknesses when data is 
missing. 
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One objective of the present invention is provision of a descriptive and detailed model 
("user decision model") of how users form opinions. The user decision model primarily deals 
with the way in which attribute opinions influence associated product opinions for a user, and 
may additionally deal with how product opinions may be used to glean meaningfiil 
5 information about attribute opinions. Such a user model is especially advantageous for use in 
other personalized retail applications. 

A user decision model for a subject user preferably includes known attribute opinions 
and predicted attribute opinions. The user decision model models flie subject user's 
subjective, mental method of combining attribute opinions into product opinions. 

10 Developing a good user decision model can be useful for finer-grained interaction with the 
subject user {e.g., potential customer). For example, the user decision model may be used to 
determine the most appealing attributes of a given product, so as to create advertisements 
whose content is tailored to the subject user. 

Alternatively, such a fine-grained user decision model could be utilized to estimate 

15 the price-demand curve of the subject user. By providing reasonably accurate price-demand 

curves for all potential customers for a product, 100% market segmentation in pricing may be 
achieved. Such complete market segmentation is conventionally attempted in the selling of 
used cars, wherein each potential customer is offered a different price for the same product, 
based on that potential customer's willingness/ability to pay for the product, and based on the 

20 perceived appeal of the product to the potential customer. Conventionally, this attempted 

market segmentation pricing does not always work well in practice because of deficiencies in 
information about the potential customer and Ms opinions. The present invention improves 
on this situation by increased collection of data (e.g., using data in a large database), more 
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completely applying tiie available data (ag., using statistically based algoritbms) and better 
leveraging the available data (eg., parallel-combinative estimation). 

External Dependencies 

5 As depicted in Figure 1, the preferred embodiment of the system 1000 depends on a 

database 4 to store large amounts of persistent data (detailed later). This embodiment also 
interacts with the user by way of a User Interface 1, which communicates to the System 3 by 
means of an Internet Connection 2. 

10 Use Cases 

The interaction between the User 5 and the System 3, takes place in several possible 
modes. For example, as shown in Figure 2, a User 5 supplies the System with his own User 
Id 2-1, and a Product Id 2-1, whereupon the System 3 responds to the User 5 with a predicted 
Customer Product Rating 2-2 conesponding to flie User Id 2-1 and Product Id 2-1. 

1 5 Another style of interaction between User 5 and System 3, shown in Figure 3, requires 

that the User 5 supply the System 3 with a User Id 3-1, and a Product Category Id 3-1. The 
System 3 can then respond by producing an ordered list of the Product Ids 3-2 fliat are 
predicted to have the highest Customer Product Rating for the User. 

As shown in Fig. 4, another use case for the System 3 occurs when a User 5 enters a 

20 Product Id 4-1, and the System 3 delivers an ordered hst of User Ids 4-2, which s predicted to 
have the highest Customer Product Rating for that Product Id 4-1 . All three use cases utilize 
the same underlying functionality of creating a prediction of Customer Product Rating for 
each pairing of Customer and Product. 
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The use case Figure 5 employs an adjunct to the main system, the Ad System 5, which 
selects a product to advertise, sends the selected User Id 5-2 and Product Id 5-3 to the main 
System 3, and then receives the resultiiig rating-sorted list of Attributes 5-4 for Hhe selected 
product. The Ad System 5 then converts the rated attributes into a displayable Advertisement 
5 5-5 for delivery to the User 5. 

In the final use case, the Producer of a specific Product will select a User 5 (most 
likely a group of individuals), and for that User 6-2 and Product 6-3, the System 3 will 
generate a PriceDemandCurve 6-4 for the User 5, consisting of the User's Rating for the 
Product at different price levels. As shown Figure 6, the DemandSystem 7 may then 

1 0 determine a discounted product price 6-5 for User 5, which can be conveyed to the User as an 
advertisement, or upon demand. 

The present invention allows a user opinion database to be used to generate price 
demand curves, or other price-related predictions, such as: calculating the probability of a 
sale of a product to a given customer at a given price or prepare product inventory based on 

1 5 expected sales. By comparing pricing calculations based on differing sets of product 

attributes, a designer may determine what combination of features to include in a newly- 
designed product or service to maximize profitability on the new product or service. While 
attempts to predict pricing and market behavior based upon alternative pricing schemes are 
not new, under the present invention, this price-related predictive work is based upon 

20 predicted user opinions, rather than facts. For example, pricing predictions may be 

conventionally based on customers expressing demand for a specific product with pre-orders 
or the like. 
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However, under the present invention, these price-related predictions based upon 
known information (e.g., known user opinions, known user demand) may be additionally or 
alternatively based upon predicted information. This predicted information, used to make 
price-related predictions, may include predicted attribute opinions of various users, predicted 
product opinions of various users and predicted user attributes. Preferred methods of 
predicting product opinions, attribute opmions and user attributes are discussed throughout 
this document. 

Demand curve data is different for each combination of user {e.g., individual user, 
group of individuals) and product The demand curve predicts different product opinions 
given different product attributes, which may include or be the price. Thus a price demand 
curve may be created for each individual by the present invention and may be helpful even 
when the product is real estate. For example, suppose that a real estate seller desires to 
propose a possible price for a piece of real estate to a user. The demand system can be used 
to generate a price demand curve for the specific user and then determine a price range that 
corresponds to soimd product opinion. The proposed price may then be based on flie price 
range to maximize the probability and profitability of the transaction for the seller. 



Software Object Design - Main System- Object Containment 

As shown in Figure 7, the Main System 3 is composed of a multitude of data tables, 
which are permanently stored in the aforementioned Database 4. Portions of each table are 
loaded into the Computer's RAM for more rapid processing, diough the details of this process 
are not necessary in order to build and use the System 99. 
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As shown in Fig. 7, flie first group of tables, are the Participant Tables 18, which 
contain data describing the Customers and the Products. 

As shown in Fig. 8, each row 301 of the Users Table 10 describes one individual 
Customer with information including userid 201, average product rating 202, num product 
5 ratings 203, average attribute rating 204, num attribute ratmgs 205, clusterid 206, decision 
mode neural net 207 and attributes 256. As shown in Fig. 8, the Users Table 10 contains 
columns such that each row of the table holds one User's User Id 201 (a number or string 
unique to each User), Average Product Rating 202, total number of Product Ratings (Num 
Product Ratings 203), Average Attribute Rating 204, total number of Attribute Ratings (Num 
10 Attribute Ratings 205), and a Cluster Id 206 used for grouping the Users. Also, the Users 
Table 301 row entry has a field for storing a serialized Neural Net 207 (or other vector 
function module - used in compositing a user's Attribute Ratings to Product Ratings). 

As shown m Fig. 9, attribute ratings table 16 has rows 302 including userid 201, 
attribute name 208, attribute value 209, explicit rating 210, estimate 211, estimate certainty 
15 212, estimate fcom user similarity using explicit ratings 213, estimate certainty fi:om user 
similarity using explicit ratings 214, estimate from user similarity using attributed ratings 
215, estimate certainty from user similarity using attributed ratings 216, estimate from 
attributed product ratings 217 and estimate certainty from attributed product ratings 218. 

As shown in Fig. 10(a), further Product data is stored in the Product Attributes Table 
20 12a, wherein each row 303 describes an Attribute of a Product, including information such as 

attribute name 208, type 219, constraint 220 and representative values 221 . 

User Attributes Table 12b (shown in Fig. 7 and shown in more detail m Fig. 10(b))is 
used in calculating user-to-user similarity based on the users' attributes, such as age, career, 
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income and so on. As shown in Fig. lOCb), each row 303b of user attribute table 12b includes 
attribute name 208b, type 219b, constraint 220b and representative values 221b. 

As shown in Fig. 1 1, product ratings table 17 has rows 304 including userid 201, 
productid 222, explicit rating 223, estimate 224, estimate certainty 225, estimate fiom user 
5 similarity from product ratings 226, estimate certainty from user similarity from product 

ratings 227, estimate from attribute ratings 228, estimate certainty from attribute ratings 229, 
estimate from user similarity from attribute ratings 230, estimate certainty from user 
similarity from attribute ratings 231, estimate from product similarity from product ratings 
232, estimate certainty from product similarity from product ratings 233, estimate from 
10 product similarity from product attributes 234, estimate certainty from product similarity 
from product attributes 235, estimate from attribute ratings by attribute similarity 236, 
estimate certainty from attribute ratings by attribute similarity 237, estimate from user 
similarity from user attributes 258 and estimate from user similarity from user attributes 259. 
As shown in Fig. 12, product similarity table 14 has rows 305 including productid_l 
15 222, product_id2 222, product rating similarity 238, product rating dimension 239, product 

attribute similarity 240 and product attribute dimension 241. 

As shown in Fig. 13, each row 306of the Products Table 1 1 partially describes a 
Product, including information such as productid 222, average product rating 242, num 
product ratings 243, nearest neighbors 244, categoiy 245 and attributes 246. 
20 As shown in Fig. 14, users cluster table 1 5 has rows 307 including clusterid 247. 

As shown in Fig. 15, user similarity table 13 has rows 308 including userid_l 201, 
userid_2 201, product rating similarity 248, product rating dimension 249, attribute rating 
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similarity 250, attribute rating dimension 251, attribute similarity 252 and attribute dimension 
253. 

Software Object Design - Supplement Systems - Object Containment 

The diagram shown in Figure 16 illustrates the components comprising the 
Personalized Advertisement System 6, including a data table of Ad Fragments 21 (stored in a 
database), and an Ad Fragment Combiner 22 module, which will assemble selected Ad 
Fragments from the Ad Fragment Table 21 into a composite ad for delivery to the user. 

The Demand Curve Generation System 7, shown in Fig. 17, shows that the Demand 
Subsystem 7 is composed of just a pair of essential data tables -the Products Table 1 7 
(shared with the main system) and the Purchase History Table 23. This pair of tables jointly 
stores the information used for predicting the number of customer purchases that would be 
made at a variety of price levels. 

Normal Operations 

As shown in Figures 18 and 19, information in the System 3 moves fr^om the 
Participant Tables 18, to the Similarity Tables 19, and finally to the Rating Tables 20. The 
first stage is to fill the Similarity Tables 19 based on statistical algorithms using data from the 
Product Ratings Table 17, Attribute Ratings Table 16, Products Table 1 1 and users Table 10 
(see Figs. 18 and 20). 

As shown in Figure 20, the second major step is to derive the Attribute Ratings Table 
16 from die User Similarity Table 13, the Product Ratings Table 17, and the Attribute Ratings 
Table 16. Next, as shown in Figure 21, the Product Ratings Table 17 is filled with a variety 
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of Estimates of the Product Ratings, using the data fiom the User Similarity Table 13, the 
Product Similarity Table 14, the Product Ratings Table 17, and the Attribute Ratings Table 
16. Finally, the variety of Product Rating Estimates is combined to produce a final Product 
Rating Estimate224. 

In the case where Personalized Advertisements are created (refer to Figure 5), the Ad 
System 6 takes in the Attribute Ratings 5-4 from the Main System 3, selects matching Ad 
Fragments from the Ad Fragment Table 21, and composites those Fragments by way of the 
Ad Fragment Combiner 22 module. The operations of the Ad Fragment Combiner 22 are 
described below in more detail. 

As shown in Fig. 22, during the application of the System 3 for Demand Curve 
Generation, the set of estimated Product Ratings 22-1 at various Price levels (given by the 
MainSystem 3), is combined with the Purchase History 22-2 data in the Demand System 7. 
This combination produces a set of Estimated Quantities 22-3 of Product purchased at various 
Price levels. Those purchase quantity estimates are processed by the Demand System 7 to 
determine the most profitable Price Level 22-4, which is then output. 

Set Comparisons and Combinations 

Throughout the discussion of this particular implementation of the system, the phrase 
"Set Similarity Methods" refers to two sets or vectors or numbers, which can include (but 
should not be constmed as limited to): covariance, Euclidean distance, hamming distance 
and cosine distance. Sets of numbers can be combined in a variety of manners, including 
averaging, weighted averaging (where the weights are usually some confidence measure), 
selecting the mode, median, or other statistical summary of the data set" 
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Filling the SimilaiityTables 

In order to fill the Product Similarity 14 table, a couple different statistics for every 
5 possible combination of products in the system must be provided. For example, the Product 
Similarity 240 is based on Similarity of Product Attributes 246. This calculation requires that 
calculation of the Similarity between two vectors of mixed numbers and discrete values (as 
described in the preceding section Set Comparisons. In this implementation example, for 
each pair of Products, the normalized Euclidean distance, between the two sets of Product 
1 0 Attributes 246 corresponding to each Product 222 in the pair, is calculated. The reciprocal of 
this distance is entered in the Product Similarity Table 14 field Product Attribute Similarity 
240. The number of Product Attributes 246 present for both products in the pair is entered as 
the Product Attribute Dimension241 . These common Attributes are used in calculating the 
Product Attribute Similarity 240. 
1 5 Likewise, for each pair of Products 222, the Product Similarity Table 14 field Product 

Rating Covariance Similarity 238 is set to a Set Comparison measure (covariance). This Set 
Comparison measure reflects the Similarity between the sets of Product Ratings 223 (firom 
the Product Ratings 17 table) corresponding to each Product 222 in the pair. The Product 
Rating Covariance Dunension 239 is set to the number of Product Ratings 223, which is non- 
20 null for both Products 222 in the pair. 

In the User Similarity Table 13, for each pair of Users 201, the Product Rating 
SiiEdlarity 248 is set to a Set Comparison measure of the Similarity (covariance) of each 
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user's set of Product Ratings 223 . The Product Rating Dimension 249 is set to the number of 
corresponding pairs of Product Ratings present for both Users. 

The field Attribute Rating Similarity 250 is set to a Set Comparison measure of the 
similarity (covariance) of each of the pair's user's set of Attribute Ratings 210. The Attribute 
5 Rating Dimension 25 1 is set to the number of corresponding pairs of Attribute Ratings 210 
present for both Users 201 . 

The Attribute Similarity 252 field of the User Similarity Table 13 is set to the Set 
Comparison measure (Euclidean distance) of the sets of User Attributes 256 corresponding to 
each of the pair's Users 201, and again the Attribute Dimension 253 is set to the number of 
10 corresponding couples of User Attributes 256 present for both Users 201. The sets of User 
Attributes 256 are demographic profiles for the users. 

Filling the Attribute Ratings Table 

In the Attribute Ratings Table 16, each pairing of Attribute Name 208 and Attribute 

1 5 Value 209 may be rated explicitly by the User 5 by way of the User Interface 1 . This explicit 
rating is thus stored in the Explicit Rating 2 1 0 field of the Attribute Ratings Table 1 6. 
Alternatively, if no Explicit Rating 210 is supplied, the rating (or attribute opinion) can be 
predicted by the methods disclosed herein. These predicted Estimates of the Attribute Rating 
can be determined through a variety of processes, including the major processes of estimating 

20 fix)m User Similarity, and estimating fi-om Decision Model Attribution. 

Attribution 
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The Decision Model Attribution starts with the assumption of some function PR(AR), 
that predicts a user's Product Rating 223 for a given Product 222 from that user's Attribute 
Ratings 210 for the Attributes 208,209 corresponding to the given Product 222. In this 
example that model is a trainable Neural Network of relatively simple design. The Neural 
5 Network preferably includes a layer of sigmoid units matching the number of attributes, 

followed by a second layer holding a single sigmoid unit This Neural Net User Model 500 is 
then trained using the available data, such that the training inputs to the Learning Module 500 
are the known Attribute Ratings 210 for a certain Product 222, and the training ou^ut is the 
Ejqplicit Product Rating 223 for that Product 222. Furthermore, network 500 can be trained 

10 on individual explicit Attribute Ratings 210, whereby the training input is just that one 
Attribute Rating 210 (explicit, knovra) (other inputs are set to default "zero preference" 
values), and the trainmg output is that same Attribute Rating 210 (estimated, predicted). 

From this training, the objective is to realize a User Decision Model 500 that does a 
relatively smooth interpolation of the User's true internal Decision Model. In other words, 

15 the statistical User decision Model of the present invention should preferably reflect the way 
that various attributes and associated attribute opinions influence a user's opinion ofproduct, 
when those product opinions are formed (consciously or unconsciously) by the user out in the 
real world. Types of user decision models, other than a Neural Net of sigmoid units, are also 
workable. For example, a Linear Decision Model, essentially averages the Attribute Ratings 

20 210 into a Product Rating 224. These other types of user decision models can be trained just 
as described for the neural network model. 

To construct an Estimated Attribute Rating 21 1, the Attribution process proceeds by 
finding an Attribute Rating Estimate 211 for a certain Attribute Name 208 and Attribute 
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Value 209. The Estimate2 1 1 minimizes the sum of differences between each Product Rating 
223 and the output of the User Decision Model 500 when all known Attribute Ratings 2 1 0 
and the Estimate 211 are input into the User Decision Model 500. Product Ratings 223 
correspondmg to Products 222 not possessing the Attribute Name 208 and Attribute Value 
5 209 are preferably ignored in this process. This minimization can be done in a variety of 
ways (e.g.. gradient descent, binary search, discrete approximation), though maximal 
accuracy is not required. 

We also form a Confidence Measure 212. The Confidence Measure 212 is non- 
interchangeable with other Confidence Measures, which are based on different scales. 
10 Confidence Measure 212 is the normalized error (e.g. Root Mean Squared Enor) of the 

Attribute Rating Estimates 211 (eg., the sum of each difference between the Explicit Product 
Rating 223 and the Product Rating Estimate resulting firom putting the Attribute Rating 
Estimate 211 into the User Decision Model 500). This Confidence Measure 212 is used later 
in combining various parallel Attribute Rating Estimates and Product Rating Estimates. 

15 

Case-Based Estimation 

In the rather more traditional style of Collaborative Filtering, we may also create 
Attribute Rating Estimates 213 for a subject User 201 based on the Attribute Rating 
Estimates 21 1 of other Similar Users 201, using any of the Similarity Measures in the User 
20 Similarity Table 13. To do this, an Attribute Rating Estimate 213 for a given User 201, 

Atbibute Name 208, and Attribute Value 209, is created based on a similarity-weighted Set 
Combination (in our case, the weighted average) of the Explicit Attribute Ratings 210 of 
other Users 201 for the same Attribute Name 208, and AttributeValue 209. Thus an Attribute 
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Rating Estimate 213 may be predicted based on Users 201 who are Similar by the Product 
Rating Similarity 248 ( from the User Similarity Table 13), Attribute Rating Similarity 250, 
or User Attribute Similarity252. A Confidence Measure 214 (non-interchangeable) can also 
be determined. This Confidence Measure 214 is proportional to the User Similarity and the 
5 User Similarity Dimension of the Users 201 whose Attribute Ratings 210 were used in 
constructing the Attribute Rating Estimate 213. 

Combining Attribute Rating Estimates 

Although, at this point, it might seem helpful to combine the various types of 

10 Attribute Rating Estimates (e.g.,. firom Attribution, and fmm Case Based Estimation) for each 
User 201, Attribute Name 208 and AttributeValue 209, it is advantageous to postpone this 
step. Instead, the various Attribute Rating Estimates are maintained to create a corresponding 
variety of Product Rating Estimates based on those Attribute Rating Estimates. The relative 
merits of the various Product Rating Estimates can be learned by the Product Rating Estimate 

1 5 Combiner 501 (e.g.. Neural Network). The inputs to and outputs fix»m Product Rating 
Estimate Combiner 501 are shown in Fig. 28. 

However, one modification of the Attribute Rating Estimates is made at this point, in 
that any available Explicit Attribute Rating 210 is used to replace the corresponding Attribute 
Rating Estimates. The modified Attribute Rating Estimates' Certainty Measures, which are 

20 known attribute opinions, are set to maximum confidence values, because these values are 
known rather than predicted. 



Filling the Product Ratings Table 
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In the Product Rating Table 17, each User 201 may rate each Product 222, though the 
fraction of Products rated by any particular User 201 is usually quite small. The Product 
Ratings 223, which are missing, will be estimated (or predicted) in this stage. Corresponding 
to the entries in the Product Rating Table 17, the estimation methods used in this 
5 implementation include: 

Product Rating Estimate from User Similarity from Product Ratings 226 
Product Rating Estimate from User Similarity from Attribute Ratings 230 
Product Rating Estimate from User Similarity from User Attributes 258 
Product Rating Estimate from Product Similarity from Product Ratings 232 
1 0 Product Rating Estimate from Product Similarity from Product Attributes 234 

Product Rating Estimate from Attribute Ratings 228 

Product Rating Estimate from Attribute Ratings Similarity to Product Attributes 236 

Product Rating Estimate from UserSimilarity from ProductRatings 226 
1 5 This is the most traditional Collaborative Filtering method. This process predicts a 

Product Rating Estimate from the User Similarity-weighted Set Combination (e.g., weighted 
average) of the other Users' Explicit Product Ratings 223 for the given Product. In this 
implementation, the other Users' Product Ratings 223 are weighted by Product Rating 
Similarity 248 from flie User Similarity Table 13, and by the Confidence Measure (Product 
20 Rating Dimension 249) of this User Similarity. A Confidence Measure for this Estimate is 
formed as the weighted Confidence Measure of the Similarities 248 used in forming the 
Estimate 226, where the weights of the partial Confidence Measures are the same weights as 
those used to weight the Product Ratings 223. 
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Product Rating Estimate from User Similarity from Attribute Ratings 230 

This method is the same as the previous process, except that it uses the User 
Similarity measure Attribute Rating Similarity 250, and flie Similarity Confidence Measure 
Attribute Rating Dimension 251. 

Product Rating Estimate from User Similarity from User Attributes 258 

This method is the same as the previous process, except that it uses the User 
Similarity measure Attribute Similarity 252, and the Similarity Confidence Measure Attribute 
Dimension 253. 

Product Rating Estimate from Product Similarity from ProductElatings 232 

Analogous to the traditional Collaborative Filtering method, this process predicts a 
Product Rating Estimate for a given User 201 and Product 222 based on the Product Ratings 
223 of other Similar Products 222, as found in the Product Similarity Table 17. Thusly, the 
Product Rating Estimate 232 is determined as the Product Similarity-weighted Set 
Combination (weighted average) of the other Products' ExpUcit Product Ratings 223 by the 
subject User201. This method uses weights that are the product of the Similarity 
ProductRatingSinularity238 and the Certainty Measure Product Rating Dimension 239. A 
Confidence Measure 233 for this Estimate 232 is formed as the weighted Confidence 
Measures of the Similarities used in forming the Estimate 232. The weights of the partial 
Confidence Measures are the same weights as those used to weight the Product Ratings 223. 
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Product Rating Estimate from Product Similarity from Product Attributes 234 

This method is the same as the previous process, except fliat it uses the Product" 
Attribute Similarity 240 (as a Product Similarity measure) and the Product Attribute 
Dimension 241 (as a Confidence Measure). 

Product Rating Estimate from Attribute Ratings 228 

This Product Rating Estimate is predicted by taking the Attribute Rating Estimates 
21 1 for the subject User 201 for the Attributes 208 of the given Product 222. These are input 
into the User Decision Model 500, to create a Product Rating Estimate 228, as shown in 
Figure 23. The Confidence Measure 229 for this Estimate 228 is formed as the CM 
weighted Set Combination (average) of the Confidence Measiures 212 of the component 
Attribute Rating Estimates 211. 

Product Rating Estimate from Attribute Ratings Similarity to Product Attributes 236 

This Product Rating Estimate is created as the Attribute SimUarity-weighted Set 
Combination (weighted average) of the Attribute Ratings 210 for the given User 201, which 
correspond to the Attributes 246 of the given Product 222. The Confidence Measure 237 for 
this Estimate is formed as the Confidence Measure-weighted Set Combination (average) of 
the Confidence Measures of the component Attribute Rating Estimates 21 1. 

Personalized Ads 

The process of creating Personalized Advertisements 5-5 from the underlying Main 
System 3 for Recommendations, begins when the Ad System 6 module selects a User and a 
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Product. The Personalized Ad 5-5 is to be generated based on this User / Product input data. 
This selection is passed to the Main System 3, which returns the set of Attribute Rating 
Estimates 5-4 for that User for the Attributes of the given Product 222. The Ad Systan 6 
then selects the Attributes 208 for the given Product 222, which correspond to the Attribute 
5 Rating Estimates 211 witii the highest estimated Utility for the User (the number selected is a 
parameter of the Ad System 6). 

Once Attributes 208 are selected, the Ad System collects the set of Ad Fragments 
which correspond to those Attributes 208 in the Ad Fragment Table21 (Figure 24) and these 
fragments are subsequently combined into a composite Advertisement 5-5 in the Ad 

1 0 Fragment Combiner 22 module. 

The Ad Fragment Combiner's 22 methodology would depend on the types of Ad 
Fragments being combined, and on the desired output format of the Advertisements 5-5. In 
the case where the Ad Fragments are text strings and the desired output format is a text string, 
the combination process may work by simple concatenation. In the case where the Ad 

1 5 Fragments are text strings and the desired output format is an image (like a "banner" 

advertisement), the text strings may be written, in a panelized fashion, contiguously onto a 
background bitmap to make the image. In the case where the Ad Fragments are images, and 
the desired output format is an image, the Ad Fragments may be super-imposed contiguously 
on a background image, and then delivered. 

20 

Demand Curve Generation 

In order to generate Demand Curve data, the Demand System 7 must first select a 
User 5, 6-2, 201 and Product 6-3, 222 for which the data is to be generated. The Demand 
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System 7 must also select the Attribute Name 208 of an Attribute 246 belonging to the 
selected Product 222, so that this Attribute 209 will be varied to generate the Demand Curve 
data. 

It is noteworthy that this selected Attribute 209 does not have to be the Price of the 
5 Product 222, but can in fact be any of the Product's Attributes 246 -fliough Price is 
considered to be the most generally useful, and is the subject of this example. 

The Main System 3 generates a set of Product Rating Estimates 6-4 from Attribute 
Ratings 21 1, by cycling the selected Attribute 209 through a range of represenbtive Attribute 
Values. On each cycle, the Attribute Rating Estimates 21 1 corresponding to the Product's 
10 Attribute Values 246 and the cycled Attribute Value, are input into a Product Rating Estimate 
Method ( 228, 232, and/or 236 ), whose output is a Product Rating Estimate. The set of 
Product Rating Estimates 6-4 corresponding to the varied set of Attribute Values is sent back 
to the Demand System 7. 

As shown in Figures 22 and 25, for each pair of Attribute Value and Product Rating 
1 5 Estimate in 22-1 , the Demand System 7 selects data from the Purchase History Table 23 that 

corresponds to the given Product 222, and Product Rating Estimate 211. In this way, a set of 
Purchase Quantity Estimates 22-3 are created that correspond to each of the Attribute Values. 
The Demand System 7 may then use some internal rules to determine which combination of 
Attribute Value (ag.. Price) and Purchase Quantity Estimate will generate the most desirable 
20 outcome (e.g.. maximum net profit). The Demand System 7 may then provide this data to 

another system (e.g. the Personalized Ad System 6) which then delivers a corresponding offer 
to the User 5. 
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Prediction of Facts 

Much of the present invention deals with the prediction of user opinions, rattier than 
facts. This is because facts, such as age or gender, are usually more readily ascertainable than 
opinions, such as like or dislike of a product or attribute of a product However, facts may 
5 also be predicted using the system of the present invention. 

For example, consider a user whose age is not known. Assume that it is known that 
the user has a height of 3 feet 6 inches. Assume that this subject user of unknown age has 
explicitly provided some of her opinions. These known opinions indicate a strong liking for 
toys, cookies and fingerpainting. It would be expected that an algorithm that looked in a 
10 large database for other users (of known ages) with similar opinions and a similar height, the 
algorithm would find that other users with similar likes would be about 5 years of age. 
Therefore, the algorithm could predict a feet (user's age) based on other information (known 
facts and known opinions in this case). 

Of course, many algorithms, known now and/or later to be developed, could be used 
15 to predict facts. However, it is important to note that the prediction of facts using user 

databases is not conventionally common, because it is generally opinions, rather than facts, 
that are predicted by user databases. 

Predictions Based On Predicted Information 
20 Conventionally, predictions (or estimates) are based on known (or explicit) 

information. According to the present invention, predictions may be based, in whole or in 
part, upon other predictions. For example, in a preceding example, it was predicted that a 
subject user was 5 years of age. Suppose that the subject user's opinion on the famous novel 
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"War and Peace" needed to be predicted. The predicted age (5 years old) of the user would 
provide a strong indication that the user would not like the novel "War and Peace" because 
very few 5 year olds enjoy this long and difficult novel. In this case, a predicted opinion 
would be based, at least in part, on a predicted fact. Again, many different algorithms (or a 
combination of parallel algorithms) could be used to do the actual prediction, but the point is 
that the predicted fact may provide a very strong indication upon which to base an accurate 
predicted opinion. 

Predicted opinions, as well as predicted facts, can also be used to make predictions. 
In a preceding example, a subject user was assumed to have known favorable opinions 
regarding toys, cookies and kindergarten classes. In a slight variation, assume that there was 
no known opinion for the subject user regarding fingerpainting, but it was known that the user 
possessed a fingerpainting set Based on this known fact (along with other available, relevant 
information), it might be predicted that the subject user had a favorable opinion of 
fingerpainting. This predicted opinion on fingerpainting could then be used with the other 
known information (height, opinion on toys, opinion on cookies) to help predict the age of the 
user, as in the preceding example. In tiiis way, a predicted opinion can help predict other 
facts and/or other opinions. 

Conclusion 

Many variations on the above-described user opinion systems are possible. Such 
variations are not to be regarded as a departure from the spirit and scope of the invention, but 
rather as subject matter intended to be encompassed within the scope of the following claims, 
to the fullest extent allowed by applicable law. 
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1 . A method of making a prediction of an opinion of a predetemiined user based on 
data in a user opinion database, the method comprising the steps of: 

determining a plurality of predictions for the predetermined user based on a respective 
pluraUty of prediction algorithms applied to the data of the user opinion database; and 

combining the pluraUty of predictions into a single parallel-combinative prediction. 

2. The method of claim 1 wherein the pluraUty of prediction algorithms include 
algorithms of at least two of the following types: coUaborative filtering algorithms, content 
based algorithms and expert based algorithms. 

3. The method of claim 1 further comprising the steps of: 

determining, for each prediction of the plurality of predictions, a respective confidence 

value; 

wherein, in the combining step, the pluraUty of predictions are combined by calculating a 
confidence weighted parallel-combinative prediction based on the confidence values. 

4. The method of claim 1 wherein, in the combining step, the plurality of predictions 
are combined by an expert weighting algorithm. 
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5. The method of claim 1 wherein, in the combining step, the plurality of predictions 
are weighted by an algorithm that is determined through the use of a trainable vector function. 

6. The method of claim 5 wherein the trainable vector function is implemented by a 
neural network. 

7. The method of claim 1 wherein: 

the user opinion database is a customer preference database comprising customer 
preferences regarding a plurality of products; and 

the predetermined user is a potential customer for at some of the plurality of products. 

8. The method of claim 7 wherein the at least one prediction(s) delivered at the 
delivering step is effectively delivered in the form of a targeted advertisement. 

9. The method of claim 1 wherein: 

the user opinion database is stored in a computer system; 
the combining step is performed by the computer system. 



10. A computer system comprising: 

a user opinion database comprising user opinion data stored in machine readable form; 
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prediction instructions comprising machine readable instructions for determining a 
plurality of predictions for a predetermined user based on a respective plurality of prediction 
algorithms applied to the user opinion data; and 

a combiner comprising machine readable instructions for combining the plurality of 
predictions into a single parallel-combinative prediction. 

1 1 . The computer system of claim 10 wherein the combiner combines the pluraUty of 
predictions by confidence weighting. 

12. The computer system of claim 10 wherein the combiner combines the plurality of 
predictions by expert weighting. 

1 3 . The computer system of claim 1 0 wherein the combiner combines the predictions 
based on an algorithm determined using a trainable vector function. 

14. The computer system of claim 1 3 wherein the trainable vector fimction is 
implemented by a neural network. 

15. A method of determining product price-demand information, the method 
comprising the steps of: 

providing a user opinion database including predicted information; and 
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predicting a relationship between price and demand using at least some of the predicted 
information. 

16. The method of claim 1 5 wherein: 

the predicted information is a predicted product opinion of at least one user. 

1 7. The method of claim 1 5 wherein: 

the predicted information is a predicted attribute opinion of at least one user. 

1 8 . The method of claim 1 5 wherein: 

the predicted information is a predicted user attribute of at least one user. 

19. The method of claim 15 wherein the predicted information is based upon a 
combination of predictions from a plurality of parallel predictive algorithm. 

20. The method of claim 15 further comprising the step of proposing a price for a 
product based upon the results of the predicting step. 

2 1 . The method of claim 20 wherein the proposed price would maximize profits 
under the predictions of the predicting step. 
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22. A method of determining a prediction of a product opinion of a predetermined 
product for a predetermined customer, the method comprising the steps of: 

providing a list of attributes present in the predetermined product; 
providing attribute opinions for at least some of the attributes; and 
calculating a predicted product opinion for the predetermined customer based on the 
attribute opinions using a product opinion algorithm. 

23 . The method of claim 22 wherein the attribute opinions comprise known attribute 
opinions. 

24. The method of claim 22 wherein the attribute opinions comprise predicted 
attribute opinions. 

25. The method of claim 22 wherein the attribute opinions comprise attribute opinions 
for the predetermined customer. 

26. The method of claim 22 wherein the attribute opinions comprise attribute opinions 
for customers other than the predetermined customer. 

27. The method of claim 22 wherein the calculating step comprises the following sub- 
steps: 



wo 02/079901 PCTAJSOl/45175 
Atty Docket No.: 28749-2/P02 Page 50 of 54 Patent 

determining an attribute opinion for the predetermined customer for each attribute of list 
of attributes; and 

compositing the attribute opinions for the predetermined customer to determine the 
product opinion prediction. 

28. The method of claim 27 wherein the compositing step composits the attribute 
opinions by taking the arithmetic mean of the estimated attribute opinions and using this 
arithmetic mean as the product opinion prediction. 

29. The method of claim 27 wherein the compositing step comprises the sub-steps of: 
ascribing respective weights to each attribute opinion; 

calculating a weighted average of the attribute opinions to determine the product opinion 
prediction. 

30. The method of claim 29 wherein the ascribing step comprises the sub-step of: 
determining the weights based upon the relationship between attribute opinions for the 

predetermined customer and product opinions for the predetermined customer, with the product 
opinions being product opinions provided by the predetermined customer for products other than 
the predetermined product. 



3 1 . The method of claim 29 wherein the ascribing step comprises the sub-step of: 
determining the weights using a trainable vector function. 
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32. A method of determining a prediction of an attribute opinion, of a predetermined 
attribute included in at least one product, for a predetermined customer, the method comprising 
the steps of; 

providing a Ust of products that include the predetermined attribute; 

providing product opinions for at least some of the at least one products that include the 
predetermined attribute; and 

calculating a predicted attribute opinion for the predetermined customer based on the 
product opinions using an attribute opinion algorithm. 

33. The method of claim 32 wherein the product opinions comprise known product 
opinions. 

34. The method of claim 32 wherein the product opinion values comprise predicted 
product opinions. 

35. The method of claim 32 wherein the product opinions comprise product opinions 
for the predetermined customer. 

36. The method of claim 32 wherein the product opinions comprise product opinions 
for customers other than the predetermined customer. 
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37. The method of claim 32 wherein the calculating step comprises the following sub- 
steps; 

collecting all available product opinions for the predetermined customer for products that 
include the predetermined attribute; and 

decompositing the product opinions for the predetermined customer to determine the 
attribute opinion prediction. 

38. A method of using a user opinion database to make a prediction, the method 
including the steps of: 

providing a user opinion database with known information; 

determining a first prediction using a first algorithm based on the known information; 
determining a second prediction using a second algorithm based at least in part on the 
furst prediction. 

39. The method of claim 38 where the first algorithm is the same as the second 
algorithm. 

40. The method of claim 3 8 wherein the first prediction is a predicted fact. 



41 . The method of claim 3 8 wherein the second prediction is a predicted fact. 
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42. The method of claim 38 wherein the first prediction is a predicted opinion. 



43. The method of claim 38 wherein the second prediction is a predicted opinion. 



End of claims section - space below this line is intentionally blank. 
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